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Abstract 

In cloud radio access networks (C-RANs), the baseband units and radio units of base stations 
are separated, which requires high-capacity fronthaul links connecting both parts. In this paper, we 
consider the delay-aware fronthaul allocation problem for C-RANs. The stochastic optimization problem 
is formulated as an infinite horizon average cost Markov decision process. To deal with the curse of 
dimensionality, we derive a closed-form approximate priority function and the associated error bound 
using perturbation analysis. Based on the closed-form approximate priority function, we propose a low- 
complexity delay-aware fronthaul allocation algorithm solving the per-stage optimization problem. The 
proposed solution is further shown to be asymptotically optimal for sufficiently small cross link path 
gains. Finally, the proposed fronthaul allocation algorithm is compared with various baselines through 
simulations, and it is shown that significant performance gain can be achieved. 
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I. Introduction 

The cloud radio access network (C-RAN) [Q]] provides a new architecture for 5G cellular 
systems. In C-RANs, the baseband processing of base stations is carried out in the cloud, i.e., 
a centralized base band unit (BBU), which launches joint signal processing with coordinated 
multi-point transmission (CoMP) and makes it possible to mitigate inter-cell interference. The 
separation of the BBU and the radio units (RUs) brings a new segment, i.e., fronthaul links, to 
connect both parts. The limited capacities of fronthaul links have a significant influence on the 
system performance of C-RANs. 

There are several existing works on fronthaul links in C-RANs. Efficient signal quantiza¬ 
tion/compression for fronthaul links is designed to maximize the network throughput for the 
uplink and downlink in [[2j and (3]|, respectively. In [4|, fronthaul quantization and transmit 
power control are optimized jointly. In f5}, energy-efficient CoMP is designed for downlink 
transmission considering fronthaul capacity. In [|6|, the capacities of fronthaul links are allocated 
under a sum capacity constraint to maximize the total throughput. In Q, the fronthaul links 
are reconfigured to apply appropriate transmission strategies in different parts according to both 
heterogeneous user profiles and dynamic traffic load patterns. However, these existing works have 
all focused on the physical layer performance without consideration of bursty data arrivals at 
the transmitters or of the delay requirement of the information flows. Since real-life applications 
(such as video streaming, web browsing or VoIP) are delay-sensitive, it is important to optimize 
the delay performance of C-RANs. 

To take the queueing delay into consideration, the fronthaul allocation policy should be a 
function of both the channel state information (CSI) and the queue state information (QSI). This 
is because the CSI reveals the instantaneous transmission opportunities at the physical layer and 
the QSI reveals the urgency of the data flows. However, the associated optimization problem 
is very challenging. A systematic approach to the delay-aware optimization problem is through 
a Markov Decision Process (MDP). In general, the optimal control policy can be obtained by 
solving the well-known Bellman equation. Conventional solutions to the Bellman equation, such 
as brute-force value iteration or policy iteration [j8], have huge complexity (i.e., the curse of 
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dimensionality), because solving the Bellman equation involves solving an exponentially large 
system of non-linear equations. 

In this paper, we focus on minimizing the average delay by fronthaul allocation. There are 
two technical challenges associated with the fronthaul allocation optimization problem: 

. Challenges due to the Average Delay Consideration: Unlike other works which optimize 
the physical layer throughput, the optimization involving average delay performance is 
fundamentally challenging. This is because the associated problem belongs to the class of 
stochastic optimization 0, which embraces both information theory (to model the physical 
layer dynamics) and queueing theory (to model the queue dynamics). A key obstacle to 
solving the associated Bellman equation is to obtain the priority function, and there is no 
easy and systematic solution in general [|8]|. 

. Challenges due to the Coupled Queue Dynamics: The queues of data flows are coupled 
together due to the mutual interference. The associated stochastic optimization problem is a 
fT-dimensional MDP, where K is the number of data flows. This K -dimensional MDP leads 
to the curse of dimensionality with complexity exponential to K for solving the associated 
Bellman equation. It is highly nontrivial to obtain a low complexity solution for dynamic 
fronthaul allocation in C-RANs. 

In this paper, we model the fronthaul allocation problem as an infinite horizon average cost 
MDP and propose a low-complexity delay-aware fronthaul allocation algorithm. To overcome 
the aforementioned technical challenges, we exploit the specific problem structure that the cross 
link path gain is usually weaker than the home cell path gain. Utilizing the perturbation analysis 
technique, we obtain a closed-form approximate priority function and the associated error 
bound. Based on that, we obtain a low-complexity delay-aware fronthaul allocation algorithm. 
The solution is shown to be asymptotically optimal for sufficiently small cross link path 
gains. Furthermore, the simulation results show that the proposed fronthaul allocation achieves 
significant delay performance gain over various baseline schemes. 

The rest of this paper is organized as follows. In Section II, we establish the wireless access 
link, fronthaul link and cloud baseband processing models as well as the queue dynamics. In 
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Figure 1. C-RAN topology 


Section III, we formulate the fronthaul allocation problem and derive the associated optimality 
conditions. In Section IV, we propose a low-complexity fronthaul allocation solution. Following 
this, the delay performance of the proposed algorithm is evaluated by simulation in Section V. 
Finally, conclusions are drawn in Section VI. 

II. System Model 

In this section, we introduce the C-RAN topology and the associated models of the access 
link, the fronthaul link and the cloud baseband processing. Based on the models, we obtain the 
throughput and the dynamics of packet queues. 

A. C-RAN Topology 

We consider a C-RAN with K cells, each of which has an RU with a single antenna. In each 
cell, the data are transmitted from a single-antenna user equipment (UE) to the RU via wireless 
access links and then to the BBU via the fronthaul link over fiber/microwave, as shown in Fig. 

m 

The time is slotted and the duration of each time slot is r. The BBU collects necessary 
information and makes the resource allocation decisions periodically at the beginning of each 
time slot. 
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B. Wireless Access Link Model 

The wireless access links are modeled as an interference channel. In the uplink, the UEs 
transmit signals to their corresponding RUs respectively, and in the meantime, cause interference 
to other RUs in the network. The signals received by the RUs are 

y = Hx + z, (1) 

where x = (xi,X 2 ,--- ,xk) t is a K -dimensional vector of the transmitted signals, in which 
Xk is transmitted by the k- th UE with power P, y = ( 2 / 1 , 2 / 2 , * * ■ ,Vk) t is a K -dimensional 
vector of the signals received by the RUs, in which y k is the signal received by the RU in 
the k- th cell, H = ( Hkj) KxK , in which H k j is the complex channel fading coefficient of the 
uplink transmission from the j-th UE to the RU in the A-th cell, z = (z\ , 2 2 , • • • , zk) t and 
z k ~ CM (0, No) is the white Gaussian thermal noise with power N 0 . 

Define H(t) as the global CSI for uplink access links at the /-th slot. We have the following 
assumption on H(£). 

Assumption 1 (CSI Model): The CSI H(t) remains constant within a time slot and is i.i.d. 
over time slots. H k j ( t ) is independent over the indices k and jQ II kj ( t ) is composed of two parts, 
i.e., H k j ( t ) = sjL k jH kj (t), where H k j(t ) is the short-term fading coefficient which follows a 
complex Gaussian distribution with mean 0 and unit variance, and L k j is the corresponding 
large-scale path gain, which is constant over the duration of the communication session. ■ 

C. Fronthaul Link Model 

Denote C k (t) as the capacity allocated to the fronthaul link between the RU in the A:-th cell 
and the BBU at the t-th slot. Let C(t) = (Ci(£), C 2 (t), • • • , CV(£)) be the uplink fronthaul 
allocation. With limited-capacity fronthaul links, the signals transmitted between the RUs and 
the BBU have to be quantized. In the uplink, the RU in each cell underconverts its received 

fin C-RANs, the simultaneously transmitting UEs using the same resource block are located in different cells. Thus, the 
distances between the RUs and those between the UEs are always large enough to make the channel fading coefficients 
independent. 
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signal and sends the quantized signal to the BBU. Define y = (yj. y 2 . ■ ■ ■ ■ Vk) t , where y k is 
the quantized signal at the RU in the k- th cell. The signals are assumed to be quantized for each 
fronthaul link separately. The quantization leads to the distortion of signal, which can be treated 
as the quantization noise, denoted as n = (ni, ri 2 , • • • , uk) t , where n k is the quantization noise 
over the k- th fronthaul link. The signals received by the BBU are expressed as 

y = y + n. (2) 


The relationship between y k and y k depends on the fronthaul capacity C k according to the 
rate-distortion theory IfTOl . which is given by I (y k : y k ) < C k , where I (y k : y k ) is the mutual 
information between y k and y k . Let N(t) = (Ni(t), N 2 (t), ■ ■ ■ ,N K (t)), where N k (t ) is the 
power of the quantization noise n k at the t-th slot. The quantization noise power induced by the 
transmission over the k -th uplink fronthaul link at the /-th slot is given by []6] 

PT.U II^WII’ + JVo 


where 


N k (t) = 

is the Euclidean norm. 


2 Chit) - l 


(3) 


D. Throughput with Cloud Baseband Processing 

The BBU performs joint decoding for the received uplink signals, which benefits the system 
performance by joint cloud processing of the signals for different cells. The cloud baseband 
processing for uplink signals at the BBU is introduced in the following assumption. 

Assumption 2 (Zero Forcing Joint Detection): Assume that ZF joint detection ifTTl . llT2l is 
adopted for the uplink in the cloud baseband processing to eliminate the inter-cell interference. 
The linear ZF receiver at the BBU can be represented by a matrix S(t) = ( S k j(t)) KxK at the 
t-th slot, where S (t) is the inverses of the channel matrix H(f), i.e., S (t) = Fl(/)^ 1 . ■ 

The uplink transmission model is described in Fig. [2] With the ZF joint detection at the BBU, 
the post-processing signal is 

Sy = x + S(z + q). (4) 


2 According to Assumption Q] the elements of H(f) are independent. Thus, rank(H(f)) = K,Vt and the inverse of H(f) 


exists. 
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Figure 2. Uplink transmission model of a C-RAN 


Considering both the thermal noise power N 0 and the quantization noise power N (t), we 
obtain the uplink data rate for the i-th UE as 

P 


R k (H (t ), C (£)) = log 2 1 + 


Ef=il|5jy(t)|| 2 (JVo + ^-(t)) 


(5) 


where N (t) is a function of H (t) and C (t), and S(t) is a function of H (t). Note that there is 
an implicit coupling among the K uplink data flows in the sense that R k depends not only on 
the fronthaul capacity allocation C k but also on Cj,\/j ^ k. 


E. Queue Dynamics 

There is a bursty data source for each UE. Let A(t) = (Ai(t)r, • • • ,A N (t)r ) be the random 
arrivals (number of bits) from the application layers at the end of the t-th time sloj\ We have 
the following assumption on A(t). 

Assumption 3 (Bursty Source Model): Assume that A k (t) is i.i.d. over slots according to a 
general distribution Pr[A fc ]. The moment generating functions of A k exist with E[A fc ] = X k . 
A k (t) is independent w.r.t. k. Furthermore, the arrival rates A = (Ai, A2, ..., Xk) lie within the 
stability region Ifl4l of the system with the given uplink resource. ■ 

3 We assume that the transmitters are causal so that the packets arrived at the time slot are not observed when the control 


actions of this time slot are performed. 












Each UE has a data queue for the bursty traffic flows towards the associated RU. Let Q k (t) G 
[0, oo) be the queue length (number of bits) at the k -th UE at the beginning of the t-th slot. Let 
Q(t) = ,QN(t )) G Q = [0, oo) K be the global QSI. The queue dynamics for the 

k -th UE can be written as 

Qk(t+ 1) = max {Q k (t) - R k (H(t), C(t))r, 0} + A k (t)r. (6) 

Remark 1 (Coupling Property of Uplink Queue Dynamics): In the uplink, the K queue dy¬ 
namics are coupled together due to the ZF processing in the BBU. Specifically, according to ©, 
the queue departure R k (H(t), C(t)) for the z-th UE depends on not only the allocated capacity 
C k (t) for the k- th fronthaul link, but also all the other elements of C(t). ■ 

III. A Control Framework of Delay-Aware Uplink Fronthaul Allocation 

In this section, we formulate the delay-aware control framework of uplink fronthaul allocation. 
We first define the control policy and the optimization objective. We then formulate the design as 
a Markov Decision Process (MDP) and derive the optimality conditions for solving the problem. 

A. Fronthaul Allocation Policy 

For delay-sensitive applications, it is important to dynamically adapt the fronthaul capacities 
C (t) based on the instantaneous realizations of the CSI (captures the instantaneous transmission 
opportunities) and the QSI (captures the urgency of K data flows). Let x — (H, Q) denote the 
global system state. We define the stationary fronthaul allocation policy below: 

Definition 1 (Stationary Fronthaul Allocation Policy): A stationary control policy for the /, -th 
UE Q k is a mapping from the system state x t0 the fronthaul allocation action of the k- th UE. 
Specifically, Q k (x) = C k > 0. Let il = {Q/, : V/>:} denote the aggregation of the control policies 
for all the K UEs. ■ 

The CSI H is i.i.d. over time slots based on the block fading channel model in Assumption Q] 
Furthermore, from the queue evolution equation in ©, Q (t + 1) depends only on Q(£) and the 
data rate. Given a control policy f2, the data rate at the /-th slot depends on H(f) and Q(x(t)). 


Hence, the global system state %(t) is a controlled Markov chain |8] with the following transition 
probability: 


Pr[x(f + l)|x(f),Sl(x(f))] =Pr[H(f + l)]Pr[Q(f + l)|x(f),^(x(f))] , ( 7 ) 


where the queue transition probability is given by 


Pr[Q(f + l)|x(t),f2(x(t))] 


Y[ k Pr [A k (t) ] if Q k (t + 1) is given by ©, Wk 


( 8 ) 


0 


otherwise, 


where the equality is due to the i.i.d. assumption of H(t) in Assumption Q] 

For technical reasons, we consider the admissible control policy defined below. 

Definition 2 (Admissible Control Policy): A policy il is admissible if the following require¬ 
ments are satisfied: 

• f2 is a unichain policy, i.e., the controlled Markov chain x (t) under f2 has a single recurrent 
class (and possibly some transient states) (8]|. 

• The queueing system under il is second-order stable in the sense that 

lim^oo where means taking expectation w.r.t. the probability 

measure induced by the control policy f2. ■ 

B. Problem Formulation 

As a result, under an admissible control policy f2, the average delay for the k -th data queue 
is given by 



(9) 


Similarly, under an admissible control policy il. the average fronthaul capacity for the k -th data 
queue is given by 



T -1 


(10) 


We formulate the delay-aware fronthaul allocation problem for C-RANs as follows: 
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Problem 1 (Delay-Aware Fronthaul Allocation Problem): The delay-aware fronthaul alloca¬ 
tion problem is formulated as 


min 

f2 


K 


L(n) = Y,(PkD k (n) + y k c k (n) 


k =1 


T—l 


lim sup - [c (Q (t) , O (x (*)))] 


T—> oo 


t =0 


(ii) 


where c(Q,C) = J2k=i +7 kC^j. (3 = {/ 3 k > 0 : \/k} are the positive weights for the 

delay cost and 7 = { 7 ^ > 0 : V/c} are the prices for the data transmission over fronthaul links. 


Note that Problem |Tj is an infinite horizon average cost MDP, which is known as a very 
difficult problem. 


C. Optimality Conditions for Uplink Fronthaul Allocation 


Problem [T]is an MDP and the Bellman equation [jH provides its optimality conditions. The 
Bellman equation involves the entire system state x — (H, Q). Exploiting the i.i.d. property of 
H(f) according to Assumption Q] we obtain the equivalent Bellman equation in the following 
theorem. 

Theorem 1 (Sufficient Conditions for Optimality): For any given weights (3, assume there 
exists a (9*, {V*(Q)}) that solves the following equivalent Bellman equation : 


9*t + V*( Q) =E 


mm 
«(x) L 


:(Q.f!(x))T + y]Pr[Q'| x ,n(x)]V*(Q') 


Q' 


Q 


VQe Q, 


( 12 ) 


Furthermore, for all admissible control policies f2, V* satisfies the following transversality 
condition : 

lim ^E" [V* (Q (T))] = 0. (13) 

T^-oo I 


Then 6* is the optimal average cost, and V* (Q) is the priority function of the K data flows. If 
there exists an admissible stationary policy f2* (x) = C* where C* attains the minimum of the 
R.H.S. of (fl2l) for all Q £ Q, then f2* is the optimal control policy for Problem Q] 

Proof: Please refer to Appendix A. ■ 
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Remark 2 (Interpretation of Theorem U}: The equivalent Bellman equation in (fl2l) is defined 
on the QSI Q only. Nevertheless, the optimal control policy 17* obtained by solving (fl2l) is 
still adaptive to the entire system state X- At each stage, when the queue length is Q(t), the 
optimal action has to strike a balance between the current cost c(Q,17(x)) and the future 
cost Pr [Q'|x, 17 (x)] R*(Q') because the action taken will affect the future evolution of 
Q(t +1). ■ 


IV. Low-Complexity Fronthaul Allocation 

One key obstacle in deriving the optimal fronthaul policy 17* is to obtain the priority function 
V*(Q) of the Bellman equation in (fT2l) . Conventional brute force value iteration or policy 
iteration algorithms can only give numerical solutions and have exponential complexity in K , 
which is highly undesirable. In this section, we shall exploit the characteristics of the topology of 
C-RANs. Specifically, we define 8 = max {L k j : \/k f j} be the worst-case path gain among all 
the cross links, which is usually weaker than the home cell path gain due to the C-RAN network 
architecture. We adopt perturbation theory w.r.t. 8 to obtain a closed-form approximation of the 
priority function V*(Q) and derive the associated error bound. Based on that, we obtain a low 
complexity delay-aware fronthaul allocation algorithm. 


A. Calculus Approach for Solving the Bellman Equation 

We adopt a calculus approach to obtain a closed-form approximate priority function. We first 
have the following theorem for solving the Bellman equation in (fl2l) . 

Theorem 2 (Calculus Approach for Solving t!72l)): Assume there exist c°° and J (Q: 8) of 
class C 2 (U5;) that satisfy 


the following partial differential equation (PDE): 


E 


mm 

n(x) 


L k =1 


+ E 

k =1 


dJ(Q-8) 

dQk 


(A fc -f4(H,C)) 


Q 


= o, 


K 


VQ e 


( 14 ) 


with boundary condition J (0; 5) = 0; 
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• For all k, 9 J qq^ is an increasing function of all Q k \ 

• J (Q; 5) = O (||Q|| 2 ). 

Then, we have 

6* = c°° + o(l), y* (Q) = J (Q; 5) + o(l), VQ e Q, (15) 

where the error term o(l) asymptotically goes to zero for sufficiently small r. 

Proof: Please refer to Appendix B. ■ 

Theorem [2] suggests that if we can solve for the PDE in (fl4l) . then the solution (J (Q; 5), c°°) 
is only o(l) away from the solution of the Bellman equation (1/*(Q), 6*). 


B. Closed-Form Approximate Priority Function via Perturbation Analysis 

The queues of the K uplink data flows are coupled due to the coupling of R k in (©. The 
following lemma establishes the intensity of the queue coupling. 

Lemma 1 (Intensity of the Uplink Queue Coupling): The coupling intensity of uplink data 
queues induced by R k in © is given by ||S'fcj(f)|| 2 = O (5) ,Vk f j. 

Proof: Please refer to Appendix C. ■ 

As a result, the solution of (fl4l) depends on the worst-case cross link interference path gain 5 
and, hence, the A'-dimcnsional PDE in (fl4l) can be regarded as a perturbation of a base system, 
as defined below. 

Definition 3 (Base System): A base system is characterized by the PDE in (fT4l) with 5 = ().■ 

According to Lemma Q] we have ||5V 0J (f)|| 2 = 0,V/c f j in the base system. We first study 
the base system and use J(Q; 0) to obtain a closed-form approximation of J( Q; 5). 

We have the following lemma summarizing the priority function J(Q; 0) of the base system. 

Lemma 2 (Decomposable Structure of J( Q; 0 )): The solution J(Q; 0) for the base system has 
the following decomposable structure: 

K 

J (Q; o) = ^ J k (Qk) > 

k =i 


( 16 ) 
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where J k (Ok) ' s the per-flow priority function for the k -th data flow given by 


A k ( ve 


a k 


QkW [ln2 El 

Jk{v) = 


a k v 


A k 


kj In 2 

fcTfc 


A kV -It-Ek 1 aklk 


na , 0 . Ik ilk - v) + e ak u 2 E 1 

2/4 hi 2 \ \u — 7fc 


^ - 7fc 
a k v 


+ cl 


(17) 


+ (ofe — Afc) 74 


Qfc7fc 
^ - 7fc 


- Afc^ 2 In 2 + b k , 


where a k = j&; c “ = 177^1 (S)> w h| ere 4 satisfies (gg) = A fc ; £i(» = 

/i°° £_ r lf h = f°° gdf; 6 /, is chosen to satisfy^ the boundary condition .//,(()) = 0 . 

Proof: Please refer to Appendix D. ■ 

Note that when 5 = 0, we have L k j = 0 for all k f j and, hence, there is no coupling 
between the UE-RU pairs. As a result, the K data queues are totally decoupled and the system 
is equivalent to a decoupled system with K independent queues. That is why the priority function 
J (Q; 0) in the base system has the decomposable structure in Lemma [2l 
When 5 > 0, J(Q; 5) can be considered as a perturbation of the solution of the base system 
J(Q;0). Using perturbation analysis on the PDE (fl4l) . we establish the following theorem on 
the approximation of 

Theorem 3 (First Order Perturbation of J( Q; 5)): J (Q: 5) is given by 

J(Q;l)=J(Q;0) + ^( L m U t Q\ 4 ^ E W) + 0 (Ql)) + ° 

k=1 \j=ljj^k \ 1=1,l^k ^ 33 J J 

(18) 



where <f> fc = (l - a * ea ^ l( ° fc) ) / (e ak E 1 (a k ) - A fc In 2). 

Proof: Please refer to Appendix E. ■ 

The priority function U(Q) is decomposed into the following three terms: 1) the base term 
J k (Q k ) obtained by solving a base system without coupling, 2) the perturbation term 
accounting for the first order coupling due to the joint processing in the BBU, and 3) the 
residual error term which goes to zero in the order of 0(l/5 2 ). As a result, we adopt the 


4 To find bk, firstly solve Qk(v) = 0 using one-dimensional search techniques (e.g., bisection method). Then bk is chosen 
such that Jkiy) = 0. 
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following closed-form approximation of U(Q): 

K K / K / K K 

r(Q) =£>«*)+ E1 E Er+r E 


(19) 


k =1 


k=l \j=l,j^k 


1=1,tyk 






C. Fronthaul Allocation Algorithm 

In this section, we use the closed-form approximate priority function in (fT9l) to capture the 
urgency information of the K data flows and obtain a low complexity delay-aware fronthaul 
allocation algorithm. Using the approximate priority function in (fl9l) . the per-stage control 
problem (for each state realization x) is given b\|f 


K 


max 

c 


E ( %^Rk(a,c)-^Ck 


k =1 


dQk 


( 20 ) 


where 9 g^ can be calculated from (fl9l) . which is given by 

¥ =Ji (<w + 24 ‘( E E g + E E ^g) ft- < 2 » 

\j=lj^k 1=1,l^k i=l,ij^k j=l,j^i,k ^ / 

The per-stage problem in (l20l) is similar to the weighted sum-rate (WSR) optimization Ifl5l . 
which can be considered as a special case of network utility maximization. However, unlike 
conventional WSR problems, where the weights are static, the weights here in (l20l) are dynamic 


and are determined by the QSI via the priority function 2-LTT. As such, the role of the QSI is 
to dynamically adjust the weight (priority) of the individual flows, whereas the role of the CSI 
is to adjust the priority of the flow based on the transmission opportunity in the rate function 

-Rfe(H, C). 

One approach to solve the WSR problem is solving the local optimization problem for each 
flow iteratively lfT5ll . In each local optimization problem for the k-th flow, the total WSR objective 
is maximized, assuming that the capacities of other links Cj , Vj ^ k do not change. The local 
optimization problem is formulated as 


max 

c k 


K 

E 

k =1 


9V (Q) 

dQk 


i4(H,C)- 7 *A 


( 22 ) 


Note that J' k (Q k ) = ( dJ / dQ d fc t/ (l/) ) I = v ( Q k ), where v (Q k ) satisfies Q k {v ( Q k )) = Qk■ 

\ / / I v=u(Qu) 
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The above local optimization problem is still difficult to solve directly. An alternative method 
is simplifying the effect of C k on the other links as a linear functions Ifl 6 l . Define 77 /. as the 
marginal increase in the utility of the z-th flow per unit increase in C k , he., 




2 


TT ik 


where I ik = Eli ,& k II^ 


( 2 c fc-l) 


P + Iik + US, 


ik 


N 0 H-— 


2 °fc-l 


hk + IlSifeU ( Nq + 2 cl_ 1 


(23) 




and YJ = P Eli HifyH* + ^ 0 . 


Adopting the linear simplification tt ik C k for the effect of C k on the z- 1 h flow in the per-stage 
local optimization problem (l22l) . we have the Karush-Kuhn-Tucker (KKT) condition as 

g>V(Q) P\\S kk fY k 2 c k 


9Qk 


(2 c k-iy 


K 


Ikk + || Sj 


kk\ 


N o+^rzi 


P + Ikk + ||'Sfcfc||‘ i ( Nq + 


Y k 


2 °fe-l 


Y tt ik = 7. (24) 


By solving (l24l) . we obtain the optimal fronthaul capacity C k for the local optimization problem 


as 


C k = log 2 


V k 


+ Ck + \J^Vk P IVkCk + P~ ||S fcfc | 


y 2 
-t u 


2 (P + J fcfc + \\S kk \\ 2 No) {Ikk + ||s fcfc || 2 No) ’ 


(25) 


where ( k = 2I 2 kk + 2 I kk (P + 2 ||S fcfc || 2 N 0 - ||S fcfc || 2 Y k ) + ||S fcfc || 2 (2 ||S fcfc || 2 N 2 - PY k + 2 PN 0 - 
2 \\S kk f N 0 Y k ) and r, k = P " S K ^ Y > . 

Based on the above analysis, we propose a low-complexity fronthaul allocation algorithm 
launched at the beginning of each slot, which is described using pseudo codes as Algorithm 
1. We denote C' r h = (C'l, C'l, ■ • • , C'l) as the allocated fronthaul capacities in the n-th 
iteration. 

Although the per-stage problem (l20l) is not convex in general, the following lemma states that 
it is a convex problem for sufficiently small 5. 

Lemma 3 (Asymptotic Convexity): When 5 is sufficiently small, the objective in (l20l) is a 
concave function of C, and the problem (l 20 l) is a convex problem. 

Proof: Please refer to Appendix F. ■ 


6 We will show later that this simplification does not affect the convergence property. 
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Algorithm 1 Delay-Aware Fronthaul Allocation 
l: Initialize n = 0 and Cj^ = 0 ,Vk 

2 : repeat 

3: for all user k do 

4 : Calculate ' 1 based on according to (l25l) 

5: end for 

6 : n = n + 1 

7 : until The difference between and C <n+r> is below a given threshold 


According to Lemma [3j we provide the convergence property and asymptotic optimality of 
Algorithm 1 in the following theorem: 

Theorem 4 (Asymptotic Optimality): When 5 is sufficiently small, starting from any feasible 
initial point C^, Algorithm 1 converges to the optimal solution of the original Problem [Q 
Proof: Please refer to Appendix G. ■ 

V. Simulation 

In this section, we evaluate the performance of the proposed low-complexity delay-aware 
fronthaul allocation algorithm for C-RANs. For performance comparison, we adopt the following 
two baseline schemes. 

• Baseline 1 [Throughput-Optimal Fronthaul Allocation]: The throughput-optimal fron¬ 
thaul allocation algorithm determines the fronthaul capacities for maximizing the total data 
rate without considering the queueing information, which is similar to that in [O but with 
ZF processing. 

• Baseline 2 [Queue-Weighted Fronthaul Allocation]: The queue-weighted fronthaul 
allocation algorithm exploits both CSI and QSI for queue stability by Lyapunov drift ltT4l 
and solves the per-stage problem (1201) replacing -jyjy 1 with Q k IfTTI . 

In the simulation, the performance of the proposed fronthaul allocation algorithm is evaluated 
in a C-RAN cluster with seven cells. A single channel is considered, and one user over the 
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channel is located randomly in each cell, with radius 500m. Poisson data arrival is considered, 
with an average arrival rate X k for the k -th UE, which is uniformly distributed between [0, 2A] 
with mean A. The path gain is calculated as L k j = 15.3+37.6 log 10 d kv with the fading coefficient 
distributed as CA7(0, 1). The average transmit power is 23dBm and the noise power spectrum 
density is -174dBm/Hz. The system bandwidth is 10MHz and the duration of the decision slot is 
10ms. The weights 7 k are the same and f3 k = 1 for all k. For comparison, the delay performances 
of different schemes are evaluated with the same total fronthaul capacity by adjusting 7 *. For 
obtaining the average performance, we consider 20 random topologies, each of which has 100 
time slots. 

Fig. [3] shows the average delay versus the average arrival rate when the total fronthaul capacity 
is 350Mbps. For all algorithms, the average delay increases when the average traffic load 
increases. It can be observed that the proposed fronthaul allocation algorithm outperforms both 
baselines, which verifies the accuracy of the priority function approximation in the proposed 
algorithm. 

Fig. |4] shows the average delay versus the total fronthaul capacity when the average arrival rate 
is 30Mbps. The proposed fronthaul allocation algorithm also achieves better performance than 
the baseline schemes. When the total fronthaul capacity is small, the average delay decreases 
significantly with the increase of the total fronthaul capacity. In contrast, when the total fronthaul 
capacity is large, the change in the average delay is relatively small with adjustment of the total 
fronthaul capacity. 

Table U illustrates a comparison of the MATFAB computational time of the proposed solution, 
the baselines and the brute-force value iteration algorithm (U in one time slot. From the results, 
our proposed algorithm has much less complexity than the brute-force value iteration algorithm. 
The computational time of our proposed algorithm is close to those of Baselines 1 & 2, and 
the difference is due to the computation of the approximate priority function. Therefore, our 
proposed algorithm achieves significant performance gain compared to the baselines, with small 
computational complexity cost. 
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Figure 3. Performance comparison with different average arrival rates when C = 350Mbps 


Table I 

Comparison of the MATLAB computational time 


Algorithm 

Time 

Baselines 1 & 2 

Proposed Algorithm 

Brute-Force Value Iteration 

0.006s 

0.043s 

> 10 5 s 


VI. Conclusions 

In this paper, we propose a low-complexity delay-aware fronthaul allocation algorithm for the 
uplink in C-RANs. The delay-aware fronthaul allocation problem is formulated as an infinite 
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Figure 4. Performance comparison with different total fronthaul capacity when A = 30Mbps 


horizon average cost Markov decision process. To deal with the curse of dimensionality, we 
exploit the specific problem structure that the cross link path gain is usually weaker than the home 
cell path gain. Utilizing the perturbation analysis technique, we obtain a closed-form approximate 
priority function and the associated error bound. Based on the closed-form approximate priority 
function, we propose a low-complexity delay-aware fronthaul allocation algorithm, solving the 
per-stage optimization problem. The proposed solution is further shown to be asymptotically 
optimal for sufficiently small cross link path gains. The simulation results verify the accuracy of 
the priority function approximation and show that the proposed fronthaul allocation algorithm 
outperforms the baselines. 
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Appendix A 
Proof of Theorem □ 


Following Prop. 4.6.1 of [HO], the sufficient conditions for the optimality of Problem 1 are that 
( 9*, {V* (Q)}) solves the following Bellman equation: 


9*t + V* (v) = min 

n(x) 


c(Q,^(x))r + ^Pr [x , |x,^(x)]l / * (xO 

x! 


= min 

n(x) 


c(Q,n(x))r + EE Pr [Q'k.fiW] Pr [H']P*(x') 

Q' H' 


(26) 


and V* satisfies the condition in (fl3l) for all admissible policies fl. Then 9* = min L (f2(%)). 

D(x) 

Taking expectation w.r.t. H on both sides of (l26l) and denoting V* (Q) = E [V* (%) | Q], we 
obtain the equivalent Bellman equation in (fl2l) in Theorem [T] 


Appendix B 
Proof of Theorem [2] 

In the proof, we shall first establish the relationship between the equivalent Bellman equation 
in (fT2l) in Theorem [2] and the approximate Bellman equation in (l27l) in the following Lemma [U 
Then, we establish the relationship between the approximate Bellman equation in (l27l) in Lemma 
0] and the PDE in (fl4l) in Theorem [2] 

1. Relationship between the Equivalent Bellman and Approximate Bellman Equations 

We establish the following lemma on the approximate Bellman equation to simplify the 
equivalent Bellman equation in (fl2l) : 

Lemma 4 (Approximate Bellman Equation): For any given weights /3, if 

• there is a unique (6 1 *, {W* (Q)}) that satisfies the Bellman equation and transversality 
condition in Theorem [[] 

• there exist 9 and V (Q) of classfzl C 2 (Vi r )) that solve the following approximate Bellman 


7 /(x)(x is a XT-dimensional vector) is of class C 2 (R^) if the first and second order partial derivatives of /(x) w.r.t. each 
element of x are continuous when x € Rtf. 







21 


equation : 

e = e 


K 


mm 

n(x) L 


z{Q,n{x)) + Y^ 


dV (Q) 


A fc -i2 fc (H,n(x)) 


Q 


, VQ g Q 


(27) 


d ® k 

and for all admissible control policies 0, the transversality condition in (fl3l) is satisfied for 
V, 

then, we have 


e* = e + o( 1), 1/*(Q) = ^(Q) + o(l), VQ G Q, 


(28) 


where the error term o(l) asymptotically goes to zero for sufficiently small slot duration r. ■ 
Proof of Lemma Let Q' = (Q \, • • • , C/ fc ) = Q(t +1) and Q = (Q 1 , • • • , Q k ) = Q (t). For 
the queue dynamics in © and sufficiently small r, we have Q' k = Qk~Rk (H, C) r + A k r, (V/c). 
Therefore, if V (Q) is of class C 2 (M+), we have the following Taylor expansion on V (Q'): 


K 


E [v (Q') |Q] =WQ) + £ 


k =1 


dV(Q) 

dQk 


A/ 


-E[i2 fc (H,n( X ))|Q])r + o(r). 


(29) 


For notation convenience, let F x (9, V, fl(x)) denote the Bellman operator. 


K 


F x (e, v. o(x)) = J2 (a» - o(*))) - e + c (Q, n ( x )) + vG x <y, si ( x )) 


k= 1 


dQ 


(30) 


for some smooth function G x and v = o(l) (w.r.t. r). Denote F X (9,V) = 
minn(Q) F x (9, V, fi(x)). Suppose (0*,V*) satisfies the Bellman equation in (fl2l) . we have 
E [F x (6**, V*) |Q] = 0, VQ G Q. Similarly, if (9, V) satisfies the approximate Bellman 
equation in (1271) . we have 


E[Ft(0,f/)|Q] =0, VQ eQ, (31) 

where FJ (9,V) = mm n(Q) F^{9,V,Q{x)) and F^{9,V,n{x)) = F x {9,V,Q(x)) ~ 

vG x (V, f2(x))- We then establish the following lemma. 

Lemma 5: If (9, V ) satisfies the approximate Bellman equation in ([27b . then 
|E[F x (0, 7)|Q] | = o(l) for any Q G Q. ■ 

Proof of Lemma [5} For any %, we have F X (9,V) = miiij-j( x ) [F^(0, V, f2(x)) + 
uG x ( v , n(x))] > min n(x) F^(9, V, Q(x)) + u min n(x) G X (V, f2(x)). Besides this, F x {9, V) < 















22 


min n(x ) F^{9, V, fl(x)) + vG x (V, where f2 f = argminn (x) F^(6, V, fl(x))- Since 

E[minn( x ) F X (Q, V, f2(x))|Q] = 0 according to (I3TT) . and F £ and G x are all smooth and 
bounded functions, we have |E[F x (0, V)|Q] | = o(l) (w.r.t. r). ■ 

We establish the following lemma to prove Lemma ® 

Lemma 6: Suppose E [F x (9*, V*) |Q] = 0 for all Q together with the transversality condition 
in (fl3l) has a unique solution (6*,V*). If (Q,V) satisfies the approximate Bellman equation in 
(ITTI) and the transversality condition in (fl3l) . then 9 = 9* + o (1), V (Q) = V* (Q) + o (1) for 
all Q, where o(l) asymptotically goes to zero as r goes to zero. ■ 

Proof of Lemma \6\ Suppose for some Q 7 , V (Q 7 ) = V* (Q 7 ) + O (1) (w.r.t. r). From 
Lemma[5] we have |E[F x (0, E)|Q] | = o(l) (w.r.t. r). Letting r — y 0, we have E [F x (9, E)|Q] = 
0 for all Q and the transversality condition in (fl3l) . However, V (Q 7 ) f V* (Q 7 ) due to V (Q 7 ) = 
V* (Q 7 ) + C> (1). This contradicts the condition that ( 9 *, V*) is a unique solution of F x (9*, V*) = 
0 for all Q and the transversality condition in (fT3l) . Hence, we must have V (Q) = V* (Q ) + o (1) 
for all Q. Similarly, we can establish 9 = 9* + o(l). ■ 


2. Relationship between the Approximate Bellman Equation and the PDE 
For notation convenience, we write J (Q) in place of J(Q;<5). It can be observed that if 
(c°°,{J(Q)}) satisfies (fT4l) . it also satisfies (ITTI) . Furthermore, since J (Q) = 0(J^k=i Qk)> 
then lim^oo [J (Q(i))] < oo for any admissible policy f2. Hence, J (Q) = Off2^ =1 Q 2 k ) 
satisfies the transversality condition in (fl3l) . Next, we show that the optimal policy Q J * obtained 
from (fT4l) is an admissible control policy according to Definition [2j 

Define a Lyapunov function as L( Q ) = J (Q). We define the conditional queue drift as 
A(Q) = E n ' J [ 'Tl!k,= \ (Qk(t + 1) — Qk(t)) |Q(t) = Q] and the conditional Lyapunov drift as 
AL(Q) = E° 7 * [L(Q(f + 1)) — L(Q(f))|Q(t) = Q]. We first have the following relationship 


between A(Q) and AL(Q): 


AL(Q) >E 


ci J 


K 


E «»(* +1) - QM) 


k= 1 


OQh 


Q (t) = Q 


(a) 

> A(Q) 


(32) 


if at least one of {Qk : V/c} is sufficiently large, where (a) is due to the condition that 


dJ( Q) 

9Qk 


IS 
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an increasing function of all Q k . 

Since (Ai,..., A#) is strictly interior to the stability region A, there exists A = (Ai + 
Ki,...,A k + kk) G A for some positive n = {r k : V/c} lH4ll . From Corollary 1 of |[T8ll . 
there exists a stationary randomized QSI-independent policy f2 such that 

K 

bkC k \Q(t) = Q] = C(k) (33) 

k =1 


C)|Q(f) = Q] > A fc + V/c, 


(34) 


where C(n) is the minimum time-averaging total fronthaul capacity for the system stability 
when the arrival rate is A. The Lyapunov drift AL(Q) is given by 

" K 


AL(Q) + E : 


n J 


ik c k r 


Q (t) = Q 


k= 1 


K 


K 


k =1 
K 


<9L(Q) fjj 

<E^+e« 


E ( ' itClT - C)r 


fc=l 


- 


0Q k 

dL{ Q) 
dQk 


-Rfc(H, C)t 


Q(f) = Q 
Q(f) = Q 


(35) 


if at least one of {Qfc : V/c} is sufficiently large, where ( b ) is because f2 J * achieves the 
minimum of (fl4l) and (c) is due to (l33l) and (l34l) . Combining (l35l) with (l32l) . we have 
A(Q) < AL(Q) < — Yhk =i 9 dof KT 4" C(k)t < 0 if at least one of {Q k : V/c} is sufficiently 
large. Therefore, E[A k — G k (li. fl J * W)|Q] < 0 when Q k > Q k for some large Q k . Let 
0fc(r, Q) = In (E[e(" 4 * !-G '^ H ’ ri A)))' |qJ} be the semi-invariant moment generating function of 
A k -G k ( H. Then, </> fc (r, Q) will have a unique positive root r k (Q) (f k (r* k ( Q), Q) = 0) 

[fT9ll . Let r* k = rj£(Q), where Q = (Q 1} ... ,Q K ). Using the Kingman bound Ifl9ll result that 
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F k (x) = Pr [Q k > x] < e r * x , if x > x k for sufficiently large x k , we have 


E n- '[J(Q)] 


K K r ,.oo 

<c’E E!!J 'K] = C E / Pr K> s l 

fe=l fc=l 


ds 


A' 


sc-E 

fc=i 

K 

^E 




F fc (s 1/2 )ds + / F fc (s 1/2 )ds 


/t 2 


fc=l 


-rfs 1 / 2 


+ / e _rfeS ds 


< oo 


(36) 


for some constant C. Therefore, is an admissible control policy and we have V (Q) = J (Q) 

and 9 = c°°. 

Combining Lemma |4l we have V* (Q) = J (Q) + o(l) and 9* = c°° + o(l) for sufficiently 
small t. 


Appendix C 
Proof of Lemma [H 


The coupling among the K uplink data queues is induced by S(t) in the expression of R k in 
©. According to Assumption [2l S(t) = The time index t is omitted in this proof for 

simplicity of expression. We adopt the adjoint matrix to obtain the inverse of the channel matrix 
H as 




M u 

Ml 2 • 

Mix 

det(H) adj(H) 

1 

M'2\ 

M 22 

M 2 r 

H 

M K i 

m K2 ■ 

■ M K k 


(37) 


where M k j is the (k,j) algebraic cofactor, which is the determinant of the submatrix formed by 
deleting the k -th row and j-th column of H multiplied by (—l) fc+J . 
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With 5 = max { L k] : dk ^ j}, we can rewrite the channel matrix H as 


0(1) 

0(y/5) 

■ o(Vs) 

o(Vs) 

0(1) 

■ 0(V5) 

0(y/5) 

o{Vs) 

■ 0(1) 


(38) 


where the K diagonal entries are 0( 1) and the other entries are 0(y/8). 

If k 7 ^ j, the submatrix formed by deleting the A -th row and j-th column of H includes K — 2 
diagonal entries of H, i.e., 0(1). As a result, when calculating the determinant of the submatrix, 
each term of the determinant is the product of K — 1 entries and at least one 0(\f5) term is 
included. Therefore, we obtain the coupling intensity ||5' fe j(f )|| 2 = 0(5) ,V7c ^ j, and Lemma 
HI holds. 


Appendix D 
Proof of Lemma [2] 


We first prove that J (Q; 0) = J2k=i Tfc (Qk)- The PDE in (fl4l) for the base system is 

r K 


E 


mm 

o(x) 


k =1 


^ ^ V + lkCk + 


A 


dJ (Q; 0) 

dQ k 


X k -R k (H.C) 


Q 


- c°° = 0. 


(39) 


We have the following lemma to prove the decomposable structures of J (Q; 0) and c°° in 
Lemma 7 (Decomposed Optimality Equation): Suppose there exist c“ and J k (Qk) £ 
C 2 (M + ) that solve the following per-flow optimality equation (PFOE): 


E 


mm 

c k > o 


^ + lkC k + J' k (Q k ) (\ k - R° k (H kk , C k ) 


Q k 


C^ = 0 , 


(40) 


where R° k (H kk ,C k ) = log 2 ^1 + 


P\\H kk \\ 2 

N 0 +N k 


and N k = 


_ P\\H kk \\ 2 +N 0 


7Ek-\ 


Then, J (Q; 0) = 


Ef= 1 J k (Qk) and c°° = Ylk=\ c “ satisfy OH). 


K oo 


Lemma [7] can be proved using the fact that the dynamics of the K queues at the UEs are 
decoupled when 5 = 0. The details are omitted for conciseness. 

Next, we solve the optimization problem in (l40l) . The optimal fronthaul capacity Cl from (l40l) 
is given by 

'PWH kk \\ 2 f J' k (Q k ) 


C* k = log 2 


Nn 


- 1 


Ik 


+ 


(41) 
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Substituting the optimal allocated fronthaul capacity Cl into (l40l) . and using the fact that ||///./. || 2 
follows a negative exponential distribution with mean L kk according to Assumption Q] we 
calculate the expectations in (l40l) as follows: 

if 4(Qk) > 7fe, the expected fronthaul capacity is 


E bkC* k \Q k ] 


' N 0~tk _ 

PL kk( J 'k(Qk)~lk) 


log 2 


/ PL kk x 

V N 0 


7fc E f N o7k 

In2 '\PL kk (J' k (Q k )- lk ) 


J'k(Qk) 

7 k 



(42) 


where E\(z) = f^° f —^dt is the exponential integral function. Otherwise, E [77,-Q:| Q k ] = 0. 
Similarly, if J' k (Qk) > 7 k, the expected data rate is 


E [K(«k k , Cl) |Q*] = r „ l»g 2 ( i 1 + PLt f/ Af ° | e ~ x dx 

N 0 

e PL ^ E ( N 0 J' k (Q k ) \ 

In 2 l \PL kk (J' k (Q k )- lk )j- 

Otherwise, E [R° k (H kk ,Cl)\Q k ] = 0. 

We then calculate c£°. Since (1401) should hold when Q k = 0, we have 


(43) 


c? = E [-, k Ct\Q k = 0] 


(44) 


E [R k (H kk , C k ) \Q k — 0] — X k . 


(45) 


Using (l42l) and (1431) . we can calculate c£° as shown in Lemma |2] Substituting (1421) . (1431) . and 
c£° into (l40l) and letting a k = we have the following ODE: 


n Qk 7fc f Q’k'Yk \ 

Pk T k M2 1 \J' k {Q k )-7k) 


+ J'k (Qk) X k 




J'k (Qk) ^E, 


f O'kJkiQk) A 

\J' k (Qk) - ik) 


0. 

(46) 


According to Section 0.1.7.3 of GDI , we can obtain the parametric solution of (1461) . as shown 
in (fT7T) in Lemma [2] 
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Appendix E 
Proof of Theorem [3] 

Taking the first order Taylor expansion of the L.H.S. of the Bellman equation in (fl4l) at L V] = 0 
(VI 7 ^ j), C k = C%, where C'l minimizes the L.H.S. of (l40l) . and using parametric optimization 
analysis [[ 211 . we have the following result regarding the approximation error: 

K K 

J(Q; 5 )-J(Q; 0 ) = J 2 E L ‘jMQ) + °(P). ( 47 > 

i=1 j= 

where J^(Q) captures the coupling terms in J (Q) satisfying 


K 

( 

■ / 

E 

X k — E 

log 2 

k =1 

V 

. V 


PL 


kk 


E 


Hij 


In 2 


1 + 


J'XQi) 

No+Nf 


H, 


kk 


! \ 


N 0 + A* 




Q 


\ 


J / 


dJjj (Q) 

9Q k 


K 

E 


A 0 + Af 


K 

E 

k=l,k^i,j I 1 -)- 


Nq+N* 

+ PLiiWHiiW' 2 l=1 ’ l C L h 

Jfc(Qfc) 


Hu 


(48) 


No +N* 


N 0 + A* 


\ 


N 0 +N; 


P Lkk Hkk 


k \L 
— i I Au 


H 


n 


Q 


= e. 


iji 


with boundary condition ./ y - (Q) \ Q =Q = 0 or ,J t j (Q) \ Q . =0 = 0, where Aj! = +N ° 


2 k —1 


and 6ij = jjf- is constant (where we treat 0 as a function of {L,. ; : Mi ^ j}). According to 


'*? ~ dL 


we have 


E 


/ 


log 2 


PL 


1 + 


kk 


II, 


kk 


! \ 


V 


No + m 




Q 


e ak Ei (a k ) 
In 2 


0 ( 1 ) 


(49) 


Then, we calculate the second term in (1481) and each part is calculated as follows: 

Q owm) 






E 

N 0 +N* 

Q 

=E 


■k* La Ha 




P La 

Hu 

2 

(n 0 + P La 

Hu 

2 )iV 0 


(50) 


4 11 


a i e ai E 1 (cii) 


An 


O (Qi 


E 


N 0 + N* 


Ljj 


Hjj 


Q 


2 An 


-E 




Hjj 


Q 


0(1) = ^0(1) 


(51) 
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Substituting these calculation results into (1481) . we rewrite the PDE as 

EA 


k =1 


+ 


_ e afc ^i dJij (Q) 


In 2 


A 


A, In 2 

K 

E 






i- ^ M ) E ^oie,) 

^ V 0 / 7 “T/. L ll 

' i=i,i^i 


(52) 


A- 


At In 2 


?Eo Wt) = o. 


An 


Ai 


k=l,k^i,j 

Using 3.8.2.1 of lf22l and taking into account the boundary conditions, we obtain that 


I ' 1 


Jij (Q) 


a,i e a iE\(aj) 

N 0 


K 

E . 

1 = 1 , 


v Ao 

2 ^ L u 


e ai Ei ( di ) — A* In 2 


-ofea + E 1 E 


K Pk (~\ — a kZ ak Ei(a k ) \ 
A 0 ^ \ k ^ No ) 


M k=l,k^i,j 


e ak E 1 (a k ) - Afc In 2 


^ (Qk) ■ 


(53) 


Substituting it into (1471) and exchanging the indices i and k, we obtain the first order perturbation 
(1181) in Theorem [3l 


Appendix F 
Proof of Lemma [3] 

We adopt the following argument to prove the convexity ll23ll : given two feasible points xi 
and x 2 , define g(t ) = /(txi + (1 — t)x 2 ), 0 < t < 1, then /(x) is a convex function of x if and 
only if g(t ) is a convex function of t, which is equivalent to cI d ^A > 0 for 0 < t < 1. To use 
this argument, we rewrite problem (l20l) as 

Rk{ H,C)V (54) 

Consider the convex combination of two feasible solutions, = {C k : V&;} and = 
{C'- 2) : Vfc}}, as C c = {Cl = tC ( k l> + (1 — t)C^ : Vfc} and 0 < t < 1. When 5 is sufficiently 
small, the second order derivative of / (C c , 5) is calculated as 

d 2 f(C‘,5) A/aP(Q) J-P 2 - 2 PZ k (^X j Y j \\S k ,\\ 2 (cf-Cf) 

* 2 " \(P + Z k f Z 2 (X, - l)^ 

P « (Xf + X,) Y s ||5fcj|| 2 (Cf - C< 2 ’) 2 
(P + Z k ) Z k (X, - 1)3 



K 


min f (C, 6) = 7aA 


fe=i 


^(Q) 

dQk 


( 55 ) 
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where X 3 = 2 tc i )+(1 f,c j J and Z k = ||S' fcj || 2 (A^ + Nj). When 5 is sufficiently small, the 

terms in the order of 0(5) can be ignored and we simplify d pc 


-.(2) 


d i 2 


as 


d 2 / (C c , 5) * ( PN o + ll^ll 2 ^ 2 ) ll^-ll 4 [CP - C, 


df 2 




d2) 

'k 


In 2 


(X k -l Y(p + z k fzl 


(56) 


fc=i 


It is obvious that 


d 2 /(C°,g) 
d t- 


> 0 in (1561) . and thus, the problem (l20l) is a convex optimization 


problem for sufficiently small 5. 


Appendix G 
Proof of Theorem [4] 

We prove the convergence by the fictitious game model [fl6l . We first construct the following 
capacity-price fictitious game model. The optimization problem of the fictitious capacity player 

k is 

.... r)V ro) 

(57) 


max u™ = 'X^-Rk (H. C) - lt C t + V 7 r ih C t . 

C ‘ i-tj* 


The optimization problem of the fictitious price player is 


max uf k c = - | 7 T ik - 

Ttik 


d 


dV(Q) 

dQi 


Ri( H,C) 


dC k 


(58) 


Each player in this game adopts the myopic best response (MBR) to update his strategy. From 
[243, the MBR updates converge to Nash Equilibrium in the supermodular games, in which the 
payoff function is supermodular in player i’s strategy and has increasing differences between 
any component of player k' s strategy and any component of any other player’s strategy. Now, 
we check if this fictitious game is supermodular. It is obvious that each player’s payoff function 
is supermodular in its own one-dimensional strategy. According to the method in IIT6l . we have 

du FW 

dc Z dlT - = 1 > 0,V7 A k ar| d the increasing difference condition is satisfied. Therefore, the 
fictitious game is a supermodular game and always converges. 

When 5 is sufficiently small, according to Lemma [3l the problem is convex and the 
supermodular game converges to the unique global optimal solution of the per-stage problem. 
Furthermore, the approximation error of the priority function in Theorem [3] approaches 0 with 
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sufficiently small <5. Therefore, the supermodular game converges to the optimal solution of 
Problem Q] with sufficiently small 5. 
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